Moment (mathematics)

Significance of the moments

The n^th moment of a real-valued continuous function f(x) of a real variable about a value c is

$\mu'_n=\int_{-\infty}^\infty (x - c)^n\,f(x)\,dx.\,\!$

It is possible to define moments for random variables in a more general fashion than moments for real values—see moments in metric spaces. The moment of a function, without further explanation, usually refers to the above expression with c = 0.

Usually, except in the special context of the problem of moments, the function f(x) will be a probability density function. The n^th moment about zero of a probability density function f(x) is the expected value of Xⁿ and is called a raw moment or crude moment^[2]. The moments about its mean μ are called central moments; these describe the shape of the function, independently of translation.

If f is a probability density function, then the value of the integral above is called the nth moment of the probability distribution. More generally, if F is a cumulative probability distribution function of any probability distribution, which may not have a density function, then the nth moment of the probability distribution is given by the Riemann-Stieltjes integral

$\mu'_n = \operatorname{E}(X^n)=\int_{-\infty}^\infty x^n\,dF(x)\,$

where X is a random variable that has this distribution and E the expectation operator or mean.

When

$\operatorname{E}(|X^n|) = \int_{-\infty}^\infty |x^n|\,dF(x) = \infty,\,$

then the moment is said not to exist. If the nth moment about any point exists, so does (n − 1)th moment, and all lower-order moments, about every point.

Variance

The second central moment is the variance, the positive square root of which is the standard deviation, σ.

Normalized moments

The normalized nth central moment or standardized moment is the nth central moment divided by σⁿ; the normalized nth central moment of x = E((x − μ)ⁿ)/σⁿ. These normalized central moments are dimensionless quantities, which represent the distribution independently of any linear change of scale.

Skewness

The third central moment is a measure of the lopsidedness of the distribution; any symmetric distribution will have a third central moment, if defined, of zero. The normalized third central moment is called the skewness, often γ. A distribution that is skewed to the left (the tail of the distribution is heavier on the left) will have a negative skewness. A distribution that is skewed to the right (the tail of the distribution is heavier on the right), will have a positive skewness.

For distributions that are not too different from the normal distribution, the median will be somewhere near μ − γσ/6; the mode about μ − γσ/2.

Kurtosis

The fourth central moment is a measure of whether the distribution is tall and skinny or short and squat, compared to the normal distribution of the same variance. Since it is the expectation of a fourth power, the fourth central moment, where defined, is always non-negative; and except for a point distribution, it is always strictly positive. The fourth central moment of a normal distribution is 3σ⁴.

The kurtosis κ is defined to be the normalized fourth central moment minus 3. (Equivalently, as in the next section, it is the fourth cumulant divided by the square of the variance.) Some authorities^[3]^[4] do not subtract three, but it is usually more convenient to have the normal distribution at the origin of coordinates. If a distribution has a peak at the mean and long tails, the fourth moment will be high and the kurtosis positive (platykurtic); and conversely; thus, bounded distributions tend to have low kurtosis (leptokurtic).

The kurtosis can be positive without limit, but κ must be greater than or equal to γ² − 2; equality only holds for binary distributions. For unbounded skew distributions not too far from normal, κ tends to be somewhere in the area of γ² and 2γ².

The inequality can be proven by considering

$\operatorname{E} ((T^2 - aT)^2)\,$

where T = (X − μ)/σ. This is the expectation of a square, so it is non-negative whatever a is; on the other hand, it's also a quadratic equation in a. Its discriminant must be non-positive, which gives the required relationship.

Mixed moments

Mixed moments are moments involving multiple variables.

Some examples are covariance, co-skewness and co-kurtosis. While there is a unique covariance, there are multiple co-skewnesses and co-kurtoses.

Higher moments

High-order moments are moments beyond 4th-order moments. The higher the moment, the harder it is to estimate, in the sense that larger samples are required in order to obtain estimates of similar quality.

Cumulants

The first moment and the second and third unnormalized central moments are additive in the sense that if X and Y are independent random variables then

$\mu_1(X+Y)=\mu_1(X)+\mu_1(Y)\,$

and

$\operatorname{Var}(X+Y)=\operatorname{Var}(X) + \operatorname{Var}(Y)$

and

$\mu_3(X+Y)=\mu_3(X)+\mu_3(Y).\,$

(These can also hold for variables that satisfy weaker conditions than independence. The first always holds; if the second holds, the variables are called uncorrelated).

In fact, these are the first three cumulants and all cumulants share this additivity property.

Sample moments

The moments of a population can be estimated using the sample k-th moment

$\frac{1}{n}\sum_{i = 1}^{n} X^k_i\,\!$

applied to a sample X₁,X₂,..., X_n drawn from the population.

It can be shown that the expected value of the sample moment is equal to the k-th moment of the population, if that moment exists, for any sample size n. It is thus an unbiased estimator.

Problem of moments

The problem of moments seeks characterizations of sequences { μ′_n : n = 1, 2, 3, ... } that are sequences of moments of some function f.

Partial moments

Partial moments are sometimes referred to as "one-sided moments." The nth order lower and upper partial moments with respect to a reference point r may be expressed as

$\mu_n^-(r)=\int_{-\infty}^r (r - x)^n\,f(x)\,dx,$

$\mu_n^+(r)=\int_r^\infty (x - r)^n\,f(x)\,dx.$

Partial moments are normalized by being raised to the power 1/n. The upside potential ratio may be expressed as a ratio of a first-order upper partial moment to a normalized second-order lower partial moment.

Moments in metric spaces

Let (M, d) be a metric space, and let B(M) be the Borel σ-algebra on M, the σ-algebra generated by the d-open subsets of M. (For technical reasons, it is also convenient to assume that M is a separable space with respect to the metric d.) Let 1 ≤ p ≤ +∞.

The p^th moment of a measure μ on the measurable space (M, B(M)) about a given point x₀ in M is defined to be

$\int_{M} d(x, x_{0})^{p} \, \mathrm{d} \mu (x).$

μ is said to have finite p^th moment if the p^th moment of μ about x₀ is finite for some x₀ ∈ M.

This terminology for measures carries over to random variables in the usual way: if (Ω, Σ, P) is a probability space and X : Ω → M is a random variable, then the p^th moment of X about x₀ ∈ M is defined to be

$\int_{M} d (x, x_{0})^{p} \, \mathrm{d} \left( X_{*} (\mathbf{P}) \right) (x) \equiv \int_{\Omega} d (X(\omega), x_{0})^{p} \, \mathrm{d} \mathbf{P} (\omega),$

and X has finite p^th moment if the p^th moment of X about x₀ is finite for some x₀ ∈ M.

External links

↑ A function such as a probability density function or cumulative distribution function; see Moment-generating function.
↑ http://mathworld.wolfram.com/RawMoment.html Raw Moments at Math-world
↑ Casella, George; Berger, Roger L. (2002). Statistical Inference (2 ed.). Pacific Grove: Duxbury. ISBN 0534243126.
↑ Ballanda, Kevin P.; MacGillivray, H. L. (1988). "Kurtosis: A Critical Review". The American Statistician (American Statistical Association) 42 (2): 111–119. doi:10.2307/2684482. http://jstor.org/stable/2684482.

Theory of probability distributions

probability mass function (pmf) · probability density function (pdf) · cumulative distribution function (cdf) · quantile function

raw moment · central moment · mean · variance · standard deviation · skewness · kurtosis · L-moment

moment-generating function (mgf) · characteristic function · probability-generating function (pgf) · cumulant

Statistics

Descriptive statistics

Continuous data

Location	Mean (Arithmetic, Geometric, Harmonic) · Median · Mode

Dispersion	Range · Standard deviation · Coefficient of variation · Percentile · Interquartile range

Shape	Variance · Skewness · Kurtosis · Moments · L-moments

Count data

Index of dispersion

Summary tables

Grouped data · Frequency distribution · Contingency table

Dependence

Pearson product-moment correlation · Rank correlation (Spearman's rho, Kendall's tau) · Partial correlation · Scatter plot

Statistical graphics

Bar chart · Biplot · Box plot · Control chart · Correlogram · Forest plot · Histogram · Q-Q plot · Run chart · Scatter plot · Stemplot · Radar chart

Data collection

Designing studies	Effect size · Standard error · Statistical power · Sample size determination

Survey methodology	Sampling · Stratified sampling · Opinion poll · Questionnaire

Controlled experiment	Design of experiments · Randomized experiment · Random assignment · Replication · Blocking · Regression discontinuity · Optimal design

Uncontrolled studies	Natural experiment · Quasi-experiment · Observational study

Statistical inference

Bayesian inference	Bayesian probability · Prior · Posterior · Credible interval · Bayes factor · Bayesian estimator · Maximum posterior estimator

Frequentist inference	Confidence interval · Hypothesis testing · Sampling distribution · Meta-analysis

Specific tests	Z-test (normal) · Student's t-test · F-test · Chi-square test · Pearson's chi-square · Wald test · Mann–Whitney U · Shapiro–Wilk · Signed-rank · Likelihood-ratio

General estimation	Mean-unbiased · Median-unbiased · Maximum likelihood · Method of moments · Minimum distance · Maximum spacing · Density estimation

Correlation and regression analysis

Correlation	Pearson product-moment correlation · Partial correlation · Confounding variable · Coefficient of determination

Regression analysis	Errors and residuals · Regression model validation · Mixed effects models · Simultaneous equations models

Linear regression	Simple linear regression · Ordinary least squares · General linear model · Bayesian regression

Non-standard predictors	Nonlinear regression · Nonparametric · Semiparametric · Isotonic · Robust

Generalized linear model	Exponential families · Logistic (Bernoulli) · Binomial · Poisson

Formal analyses	Analysis of variance (ANOVA) · Analysis of covariance · Multivariate ANOVA

Data analyses and models for other specific data types


Multivariate statistics	Multivariate regression · Principal components · Factor analysis · Cluster analysis · Copulas

Time series analysis	Decomposition · Trend estimation · Box–Jenkins · ARMA models · Spectral density estimation

Survival analysis	Survival function · Kaplan–Meier · Logrank test · Failure rate · Proportional hazards models · Accelerated failure time model

Categorical data	McNemar's test · Cohen's kappa

Applications

Engineering statistics	Methods engineering · Probabilistic design · Process & Quality control · Reliability · System identification

Environmental statistics	Geostatistics · Climatology

Medical statistics	Epidemiology · Clinical trial · Clinical study design

Social statistics	Actuarial science · Population · Demography · Census · Psychometrics · Official statistics · Crime statistics

Category · Portal · Outline · Index